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<110> APPLICANT-: Hofmann, Kathryn J. 

Jansen, Kathrin U. 

Neeper, Michael P. 
<120> TITLE OF INVENTION: DNA" ENCODING HUMAN PAPILLOMAVIRUS TYPE 

18 

<130> FILE REFERENCE: 19424PC 

<140> CURRENT APPLICATION NUMBER: 08/913,644 
<141> CURRENT FILING DATE: 1997-11-21 

<150> PRIOR APPLICATION NUMBER: PCT/US96/03 64 9 
<151> PRIOR FILING DATE: 1996-03-18 
<150> PRIOR APPLICATION NUMBER: 08/408,669 
<151> PRIOR FILING DATE: 1995-03-22 
<150> PRIOR APPLICATION NUMBER: 08/409,122 
<151> PRIOR FILING DATE: 1995-03-22 
<160> NUMBER OF SEQ ID NOS : 16 

<170> SOFTWARE: FastSEQ for Windows Version 4.0 
<210> SEQ ID NO: 1 
<211> LENGTH: 1524 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: HPV18 LI Consensus Sequence 
<400> SEQUENCE: 1 

atggctttgt ggcggcctag tgacaatacc gtataccttc cacctccttc tgtggcaaga 
gttgtaaata ctgatgatta tgtgactcgc acaagcatat tttatcatgc tggcagctct 



T 



60 
120 



agattattaa ctgttggtaa tccatatttt agggttcctg caggtggtgg caataagcag 180 
gatattccta aggtttctgc ataccaatat agagtatttc gggtgcagtt acctgaccca 240 
aataaatttg gtttacctga taatagtatt tataatcctg aaacacaacg tttagtgtgg 300 
gcctgtgctg gagtggaaat tggccgtggt cagcctttag gtgttggcct tagtgggcat 360 
ccattttata ataaattaga tgacactgaa agttcccatg ccgctacgtc taatgtttct 420 
gaggacgtta gggacaatgt gtctgtagat tataagcaga cacagttatg tattttgggc 480 
tgtgcccctg ctattgggga acactgggct aaaggcactg cttgtaaatc gcgtccttta 540 
tcacagggcg attgcccccc tttagaactt aagaacacag ttttggaaga tggtgatatg 600 
gtagatactg gatatggtgc catggacttt agtacattgc aagatactaa atgtgaggta 660 
ccattggata tttgtcagtc tatttgtaaa tatcctgatt atttacaaat gtctgcagat 720 
ccttatgggg attccatgtt tttttgctta cgacgtgagc agctttttgc taggcatttt 780 
tggaataggg caggtactat gggtgacact gtgcctcaat ccttatatat taaaggcaca 840 
ggtatgcgtg cttcacctgg cagctgtgtg tattctccct ctccaagtgg ctctattgtt 900 
acctctgact cccagttgtt taataaacca tattggttac ataaggcaca gggtcataac 960 
aatggtatct gctggcataa tcaattattt gttactgtgg tagataccac tcgtagtacc 1020 
aatttaacaa tatgtgcttc tacacagtct cctgtacctg ggcaatatga tgctaccaaa . 1080 
tttaagcagt atagcagaca tgttgaagaa tatgatttgc agtttatttt tcagttatgt 1140 
actattactt taactgcaga tgttatgtcc tatattcata gtatgaatag cagtatttta 1200 
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58 gaggattgga actttggtgt tccccccccg ccaactacta gtttggtgga tacatatcgt 1260 

59 tttgtacaat ctgttgctat tacctgt'caa aaggatgctg caccagctga aaataaggat 1320 

60 ccctatgata agttaaagtt ttggaatgtg gatttaaagg aaaagttttc tttggactta 1380 

61 gatcaatatc cccttggacg taaatttttg gttcaggctg gattgcgtcg caagcccacc 1440 

62 ataggccctc gtaaacgttc tgctccatct gccactacgt cttctaaacc tgccaagcgt 1500 

63 gtgcgtgtac gtgccaggaa gtaa 1524 

65 <210> SEQ ID NO: 2 

66 <211> LENGTH: 507 

67 <212> TYPE: PRT 

68 <213> ORGANISM: Artificial Sequence 

70 <220> FEATURE: 

71 <223> OTHER INFORMATION: HPV18 LI Consensus Sequence 



"7 "3 


<400> SEQUENCE: 


Z 
























1 A 


ixie l 


M.± a. 


Leu 


irp 


Arg 


Pro 


Ser 


Asp 


Asn 


I nr 


val 


Tyr 


Leu 


Pro 


Pro 


Pro 


/ D 


1 








c 
D 










1 n 
X u 










15 




76 


Ser 


Val 


Ala 


Arg 


Val 


Val 


Asn 


Thr 


Asp 


Asp 


Tyr 


Val 


Thr 


Arg 


Thr 


Ser 


77 








20 










25 










30 






78 


He 


Phe 


Tyr 


His 


Ala 


Gly 


Ser 


Ser 


Arg 


Leu 


Leu 


Thr 


Val 


Gly Asn 


Pro 


79 






35 










40 










45 








80 


Tyr 


Phe 


Arg 


Val 


Pro 


Ala 


Gly 


Gly 


Gly 


Asn 


Lys 


Gin 


Asp 


lie 


Pro 


Lys 


81 




50 










55 










60 










82 


Val 


Ser 


Ala 


Tyr 


Gin 


Tyr 


Arg 


Val 


Phe 


Arg 


.Val 


Gin 


Leu 


Pro 


Asp 


Pro 


83 


65 










70 










75 










80 


84 


Asn 


Lys 


Phe 


Gly 


Leu 


Pro 


Asp 


Asn 


Ser 


He 


Tyr 


Asn 


Pro 


Glu 


Thr 


Gin 


85 










85 










90 










95 




86 


Arg 


Leu 


Val 


Trp 


Ala 


Cys 


Ala 


Gly Val 


Glu 


He 


Gly 


Arg 


Gly 


Gin 


Pro 


87 








100 










105 










110 






88 


Leu 


Gly 


Val 


Gly 


Leu 


Ser 


Gly 


His 


Pro 


Phe 


Tyr 


Asn 


Lys 


Leu 


Asp Asp 


89 






115 










120 










125 








90 


Thr 


Glu 


Ser 


Ser 


His 


Ala 


Ala 


Thr 


Ser 


Asn 


Val 


Ser 


Glu 


Asp 


Val 


Arg 


91 




130 










135 










140 










92 


Asp 


Asn 


Val 


Ser 


Val 


Asp 


Tyr 


Lys 


Gin 


Thr 


Gin 


Leu 


Cys 


He 


Leu 


Gly 


93 


145 










150 










155 










160 


94 


Cys 


Ala 


Pro 


Ala 


He 


Gly 


Glu 


His 


^Trp 


Ala 


Lys 


Gly 


Thr 


Ala 


Cys 


Lys 


95 










165 










170 










175 




96 


Ser 


Arg 


Pro 


Leu 


Ser 


Gin 


Gly 


Asp 


Cys 


Pro 


Pro 


Leu 


Glu 


Leu 


Lys 


Asn 


97 








180 










185 










190 






98 


Thr 


Val 


Leu 


Glu 


Asp 


Gly 


Asp 


Met 


Val 


Asp 


Thr 


Gly 


Tyr 


Gly Ala 


Met 


99 






195 










200 










205 









100 Asp Phe Ser Thr Leu Gin Asp Thr Lys Cys Glu Val Pro Leu Asp He 

101 210 215 220 

102 Cys Gin Ser He Cys Lys Tyr Pro Asp Tyr Leu Gin Met Ser Ala Asp 

103 225 230 235 240 

104 Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gin Leu Phe 

105 245 250 255 

106 Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro 

107 260 265 270 

108 Gin Ser Leu Tyr He Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser 

109 275 280 285 
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110 


Cys 


Val 


Tyr Ser 


Pro 


Ser 


Pro 


Ser 


Gly 


Ser 


He 


Val 


Thr 


Ser Asp 


Ser 


111 




290 








295 










300 










112 


Gin 


Leu 


Phe Asn 


Lys 


Pro 


Tvr 


Tro 


Leu 


His 


Lvs 


Ala 


Gin 


Gly 


His 


Asn 


113 


305 








310 










315 










320 


114 


Asn 


Glv 


He Cys 


Tro 

IT 


His 


Asn 


Gin 


Leu 


Phe 


Val 


Thr 


Val 


Val 


Asp 


Thr 


115 








325 










330 










335 




116 


Thr 


Arg 


Ser Thr 


Asn 


Leu 


Thr 


He 


Cys 


Ala 


Ser 


Thr 


Gin 


Ser 


Pro 


Val 


117 






.340 










345 










350 






118 


Pro 


Glv 


Gin Tyr 


Asp 


Ala 


Thr 


Lys 


Phe 


Lys 


Gin 


Tvr 


Ser 


Arq 


His 


Val 


119 






355 








360 










365 








120 


Glu 


Glu 


Tyr Asp 


Leu 


Gin 


Phe 


He 


Phe 


Gin 


Leu 


Cys 


Thr 


He 


Thr 


Leu 


121 




370 








375 










380 










122 


Thr 


Ala 


Asd Val 


Met 


Ser 


Tyr 


lie 


His 


Ser 


Met 


Asn 


'Ser 


Ser 


He 


Leu 


123 


385 








390 










395 










400 


124 


Glu 


Asp 


Trn Asn 


Phe 


Gly Val 


Pro 


Pro 


Pro 


Pro 


Thr 


Thr 


Ser 


Leu 


Val 


125 








405 










410 










415 




126 


Asp 


Thr 


Tyr Arg 


Phe 


Val 


Gin 


Ser 


Val 


Ala 


lie 


Thr 


Cys 


Gin 


Lys 


Asp 


127 






420 










425 










430 






128 


Ala 


Ala 


Pro Ala 


Glu 


Asn 


Lys 


Asp 


Pro 


Tyr 


Asp 


Lys 


Leu 


Lys 


Phe 


Trp 


129 






435 








440 










445 








130 


Asn 


Val 


Asp Leu 


Lys 


Glu 


Lys 


Phe 


Ser 


Leu 


Asp 


Leu 


Asp 


Gin 


Tyr 


Pro 


131 




450 








455 










4 60 










132 


Leu 


Gly 


Arg Lys 


Phe 


Leu 


Val 


Gin 


Ala 


Gly 


Leu 


Arg 


Arg 


Lys 


Pro 


Thr 


133 


465 








470 










475 










480 


134 


He 


Gly 


Pro Arg 


Lys 


Arg 


Ser 


Ala 


Pro 


Ser 


Ala 


Thr 


Thr 


Ser 


Ser 


Lys 


135 








485 










490 










495 




136 


Pro 


Ala 


Lys Arg Val 


Arg 


Val 


Arg 


Ala 


Arg 


Lys 













137 500 505 

140 <210> SEQ ID NO: 3 

141 <211> LENGTH: 1389 

142 <212> TYPE: DNA 

143 <213> ORGANISM: Artificial Sequence 
14 5 <220> FEATURE: 

14 6 <223> OTHER INFORMATION: HPV18 L2 Consensus Sequence 
148 <400> SEQUENCE: 3 

14 9 atggtatccc accgtgccgc acgacgcaaa cgggcttcgg tgactgactt atataaaaca 60 

150 tgtaaacaat ctggtacatg tccatctgat gttgttaata aggtagaggg caccacgtta 120 

151 gcagataaaa tattgcaatg gtcaagcctt ggtatatttt tgggtggact tggcataggt 180 

152 actggaagtg gtacaggggg tcgtacaggg tacattccat tgggtgggcg ttccaataca 24 0 

153 gttgtggatg tcggtcctac acgtcctcca gtggttattg aacctgtggg ccccacagac 300 

154 ccatctattg ttacattaat agaggactca agtgttgtta catcaggtgc acctaggcct 360 

155 acttttactg gcacgtctgg gtttgatata acatctgctg gtacaactac acctgcagtt 420 

156 ttggatatca caccttcgtc tacctctgtt tctatttcca caaccaattt taccaatcct 480 

157 gcattttctg atccgtccat tattgaagtt ccacaaactg gggaggtgtc aggtaatgta 540 

158 tttgttggta cccctacatc tggaacacat gggtatgaag aaataccttt acaaacattt 600 

159 gcttcttctg gtacggggga ggaacccatt agtagtaccc cattgcctac tgtgcggcgt 660 

160 gtagcaggtc cccgccttta cagtagggcc taccaacaag tgtctgtggc taaccctgag 720 

161 tttcttacac gtccatcctc tttaattacc tatgacaacc cggcctttga gcctgtggac 780 

162 actacattaa catttgagcc tcgtagtaat gttcctgatt cagattttat ggatattatc 840 
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163 cgtttacata ggcctgcttt aacatccagg cgtggtactg tgcgctttag tagattaggt 900 

164 caaagggcaa ctatgtttac ccgtagcggt acacaaatag gtgctagggt tcacttttat 960 

165 catgatataa gtcctattgc accctcccca gaatatattg aactgcagcc tttagtatct 1020 

166 gccacggagg acaatggctt gtttgatata tatgcagatg acatagaccc tgcaatgcct 1080 

167 gtaccatcgc gtcctactac ctcctctgca gtttctacat attcgcccac tatatcatct 1140 

168 gcctcttcct atagtaatgt aacggtccct ttaacctcct cttgggatgt gcctgtatac 1200 

169 acgggtcctg atattacatt accacctact acctctgtat ggcccattgt atcacccaca 1260 

170 gcccctgcct ctacacagta tattggtata catggtacac attattattt gtggccatta 1320 

171 tattatttta ttcctaaaaa gcgtaaacgt gttccctatt tttttgcaga tggctttgtg 1380 

172 gcggcctag 1389 

174 <210> SEQ ID NO: 4 

175 <211> LENGTH: 461 

176 <212> TYPE: PRT 

177 <213> ORGANISM: Artificial Sequence 
17 9 <220> FEATURE: 

180 <223> OTHER INFORMATION: HPV18 L2 Consensus Sequence 

182 <400> SEQUENCE: 4 

183 Met Val Ser His Arg Ala Ala Arg Arg Lys Arg Ala Ser Val Thr Asp 

184 1 5 10 15 

185 Leu Tyr Lys Thr Cys Lys Gin Ser Gly Thr Cys Pro Ser Asp Val Val 

186 20 25 30 

187 Asn Lys Val Glu Gly Thr Thr Leu Ala Asp Lys lie Leu Gin Trp Ser 

188 35 40 45 

189 Ser Leu Gly He Phe Leu Gly Gly Leu Gly He Gly Thr Gly Ser Gly 

190 50 55 60 

191 Thr Gly Gly Arg Thr Gly Tyr He Pro Leu Gly Gly Arg Ser Asn Thr 

192 65 70 75 80 

193 Val Val Asp Val Gly Pro Thr Arg Pro Pro Val Val He Glu Pro Val 

194 85 90 95 

195 Gly Pro Thr Asp Pro Ser He Val Thr Leu He Glu Asp Ser Ser Val 

196 100 105 110 

197 Val Thr Ser Gly Ala Pro Arg Pro Thr Phe Thr Gly Thr Ser Gly Phe 

198 115 120 125 

199 Asp He Thr Ser Ala Gly Thr Thr Thr Pro Ala Val Leu Asp lie Thr 

200 130 135 140 

201 Pro Ser Ser Thr Ser Val Ser He Ser Thr Thr Asn Phe Thr Asn Pro 

202 145 150 155 160 

203 Ala Phe Ser Asp Pro Ser He lie Glu Val Pro Gin Thr Gly Glu Val 

204 165 170 175 

205 Ser Gly Asn Val Phe Val Gly Thr Pro Thr Ser Gly Thr His Gly Tyr 

206 180 185 190 

207 Glu Glu He Pro Leu Gin Thr Phe Ala Ser Ser Gly Thr Gly Glu Glu 

208 195 200 205 

209 Pro He Ser Ser Thr Pro Leu Pro Thr Val Arg Arg Val Ala Gly Pro 

210 210 - 215 220 

211 Arg Leu Tyr Ser Arg Ala Tyr Gin Gin Val Ser Val Ala Asn Pro Glu 

212 225 230 235 240 

213 Phe Leu Thr Arg Pro Ser Ser Leu He Thr Tyr Asp Asn Pro Ala Phe 

214 245 250 255 
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215 


Glu 


Pro 


Val 


Asp 


Thr 


Thr 


Leu 


Thr 


Phe 


Glu 


Pro 


Arg 


Ser 


Asn 


Val 


Pro 


216 








260 










265 










270 






217 


Asp 


Ser 


Asp 


Phe 


Met 


Asp 


He 


He 


Arg 


Leu 


His 


Arg 


Pro 


Ala 


Leu 


Thr 


218 






275 










280 










285 








21 9 

Z 1 Zf 


Qpi -r 

OCi 


Arg 


Arg 


Gl u 


Thr 


Val 


Arg 


Phe 


Ser 


Arg 


Leu 


Gly Gin 


Arg 


Ala 


Thr 


990 

Z Z tJ 




290 

Z J V 




















0\J\J 










991 

z z i 


Mpr 


Phe 


Thr 


A r rr 


Qp r 


ftl V 


Thr 


Gin 


11C 


uxy 


Ala 


Arg 


Val 


His 


Phe 


i yx 


999 
z z z 


j \j j 










^10 

J1U 










31 S 










320 
j z u 


99^ 

Z Z J 


H-i c- 




Tip 
lie 


JCi 


Pm 
nu 


T 1 p 

1 1 C 


Alp 


Pro 


OCI 


Pro 


r;i n 


T\/r 

lyr 


lie 


U1U 


T,pn 


Gin 


99 & 
z z *± 










j 










O J v 










~J 




9 9^ 

Z Z ~J 


Pro 

IT X (J 


T .on 
lie u 


Vp 1 




Al ^ 

-riXcl 


i i ix 




7\qn 


no I i 


oiy 


T .ph 
Lie u. 


Phe 


Asp 


T1p 
-Lit; 


T\/r 
iyr 


Al 3 
1 a 


226 








340 










345 










350 






991 
z z / 




7\on 
nb p 


T1p 

± j_e 


■ 7\ on 
nbp 


ITX O 




lYlcX. 


rIO 


Vdl 


IrX vJ 


Qa v 


Arg 


Pro 


Th r 
X XIX 


Th r 
l ill 


Qpr 
Ocl 


999, 

z z o 






j j j 










360 

J VJ \J 










365 








zz. y 


Ser 


Ala 


vai 


Ser 


1 nr 


Tyr 


Ser 


Pro 


i nr 


lie 


Ser 


Ser 


Ala 


Ser 


Ser 


Tyr 


230 




370 










375 










380 










231 


Ser 


Asn 


Val 


Thr 


Val 


Pro 


Leu 


Thr 


Ser 


Ser 


Trp 


Asp 


Val 


Pro 


Val 


Tyr 


232 


385 










390 










395 










400 


233 


Thr 


Gly 


Pro 


Asp 


He 


Thr 


Leu 


Pro 


Pro 


Thr 


Ser 


Val 


Trp 


Pro 


He 


Val 


234 










405 










410 










415 




235 


Ser 


Pro 


Thr 


Ala 


Pro 


Ala 


Ser 


Thr 


Gin 


Tyr 


He 


Gly 


He 


His 


Gly 


Thr 


236 








420 










425 










430 






237 


His 


Tyr 


Tyr" 


Leu 


Trp 


Pro 


Leu 


Tyr 


Tyr 


Phe 


He 


Pro 


Lys 


Lys 


Arg 


Lys 


238 






435 










440 










445 








239 


Arg 


Val 


Pro 


Tyr 


Phe 


Phe 


Ala 


As P 


Gly 


Phe 


Val 


Ala 


Ala 








240 




450 










455 










460 











243 <210> SEQ ID NO: 5 

244 <211> LENGTH: 41 

245 <212> TYPE: DNA 

24 6 <213> ORGANISM: Artificial Sequence 

248 <220> FEATURE: 

249 <223> OTHER INFORMATION: oligonucleotide, sense primer 

251 <400> SEQUENCE: 5 

252 gaagatctca caaaacaaaa tggctttgtg gcggcctagt g 41 

254 <210> SEQ ID NO: 6 

255 <211> LENGTH: 36 

256 <212> TYPE: DNA 

257 <213> ORGANISM: Artificial Sequence 

259 <220> FEATURE: 

260 <223> OTHER INFORMATION: oligonucleotide, antisense primer 
2 62 <4 00> SEQUENCE: 6 

263 gaagatcttt acttcctggc acgtacacgc acaege 36 

265 <210> SEQ ID NO: 7 

266 <211> LENGTH: 45 

267 <212> TYPE: DNA 

268 <213> ORGANISM: Artificial Sequence 

270 <220> FEATURE: 

271 <223> OTHER INFORMATION: oligonucleotide, sense primer 
273 <400> SEQUENCE: 7 
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